Rethinking generalization requires revisiting old ideas: statistical mechanics approaches and complex learning behavior
نویسندگان
چکیده
We describe an approach to understand the peculiar and counterintuitive generalization properties of deep neural networks. The approach involves going beyond worst-case theoretical capacity control frameworks that have been popular in machine learning in recent years to revisit old ideas in the statistical mechanics of neural networks. Within this approach, we present a prototypical Very Simple Deep Learning (VSDL) model, whose behavior is controlled by two control parameters, one describing an effective amount of data, or load, on the network (that decreases when noise is added to the input), and one with an effective temperature interpretation (that increases when algorithms are early stopped). Using this model, we describe how a very simple application of ideas from the statistical mechanics theory of generalization provides a strong qualitative description of recently-observed empirical results regarding the inability of deep neural networks not to overfit training data, discontinuous learning and sharp transitions in the generalization properties of learning algorithms, etc.
منابع مشابه
The Effect of Electronical Media on the Reinforcement of Social Behavior of Youth from the Computer Course Professors and Students Viewpoints of Sari Islamic Azad University
The goal of research was the effect of electronical learning media on the reinforcement of youth social behavior from the point of view of computer course professors and students of Islamic Azad University of Sari. The statistical population was included of all computer students and professors of I.A.U of Sari. The statistical sample was identified by using of the sample content identification ...
متن کاملBig Data: Rethinking Text Visualization
In this white paper we discuss text visualization approaches and how these are important for text analytics as done with the KMX technology of Treparel. Text visualization is most powerfull when it supports understanding complex patterns in data and support decision making. Statistical and machine learning techniques are used to find patterns and relationships that can then be visualized. Class...
متن کاملEmpirical Risk Minimization Versus Maximum-Likelihood Estimation: a Case Study
We study the interaction between input distributions, learning algorithms and nite sample sizes in the case of learning classiication tasks. Focusing on the case of normal input distributions, we use statistical mechanics techniques to calculate the empirical and expected (or generalization) errors for several well-known algorithms learning the weights of a single-layer perceptron. In the case ...
متن کاملExperts or an Ensemble? a Statistical Mechanics Perspective of Multiple Neural Network Approaches
In the framework of statistical physics, we studied the 'en-semble learning' and the 'mixture of experts', which are the typical re-alizations of the mutiple neural network approach. Generalization capabilities of the two methods are analyzed. We discuss the pro and con of the two approaches, and the possibility of uniied method combining the merit of two approaches.
متن کاملStatistical mechanics on isoradial graphs
Isoradial graphs are a natural generalization of regular graphs which give, for many models of statistical mechanics, the right framework for studying models at criticality. In this survey paper, we first explain how isoradial graphs naturally arise in two approaches used by physicists: transfer matrices and conformal field theory. This leads us to the fact that isoradial graphs provide a natur...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1710.09553 شماره
صفحات -
تاریخ انتشار 2017